Maximum Likelihood Acoustic Factor Analysis Models for Robust Speaker Verification in Noise
نویسندگان
چکیده
منابع مشابه
Acoustic factor analysis based universal background model for robust speaker verification in noise
The Universal Background Model (UBM) is known as a speaker independent Gaussian Mixture Model (GMM) trained on a large speech corpus containing many speakers’ recordings in various conditions. When noisy test data is involved, UBM trained on clean data is generally not optimal. Using noisy data for UBM training, however, creates a bias towards the specific development noise samples resulting in...
متن کاملExperiments in speaker verification using factor analysis likelihood ratios
We report the results of some speaker verification experiments on the NIST 1999 and NIST 2000 test sets using factor analysis likelihood ratio statistics. For the experiments on the 1999 test set we had to use a mismatched training set, namely Phases 1 and 2 of the Switchboard II corpus, to train the factor analysis model. Our results on this test set are are comparable to (but not better than)...
متن کاملRobust Speaker Verification in Colored Noise Environment
Absfrm-Noise robustness of automatic speaker verification systems is crucial in real life applications. A study on the performance of several spectral subtraction-based speech enhancement techniques shows the poor performance of these algorithms when used as a preprocessing stage of the speaker verification system in the presence of colored noise. In this paper, a new technique based on the add...
متن کاملMaximum Likelihood i-vector Space Using PCA for Speaker Verification
This paper proposes a new approach to training the i-vector space using a variant of PCA with the Baum-Welch statistics for speaker verification. In eigenvoice the rank of variability space is bounded by the number of training speakers, so a variant of the probabilistic PCA approach is introduced for estimating the parameters. But this constraint doesn’t exist in i-vector model because the numb...
متن کاملEnvironment adaptation for robust speaker verification by cascading maximum likelihood linear regression and reinforced learning
In speaker verification over public telephone networks, utterances can be obtained from different types of handsets. Different handsets may introduce different degrees of distortion to the speech signals. This paper attempts to combine a handset selector with (1) handset-specific transformations, (2) reinforced learning, and (3) stochastic feature transformation to reduce the effect caused by t...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: IEEE/ACM Transactions on Audio, Speech, and Language Processing
سال: 2014
ISSN: 2329-9290,2329-9304
DOI: 10.1109/taslp.2013.2292356